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Abstract 

PQ ' The main aim of this paper is to develop a chent/server based model for computing the weighted 

\ Moore-Penrose inverse using the partitioning method as well as for storage of generated results. 

• . The web application is developed in the PHP / MySQL environment. The source code is open and 

' free for testing by using a Web browser. Influence of different matrix representations and storage 

systems on the computational time is investigated. The CPU time for searching the previously 
stored pseudo-inverses is compared with the CPU time spent for new computation of the same 

> ■ 

00 \ AMS Subj. Class.: I5A09, 68P15, 68N15. 

j^l^ \ Key words: PHP script, MySQL database, Weighted Moore-Penrose inverse. Matrix storage formats. 
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1 Introduction 

Since the mid-1990s, there has been a surge of interest among the academics and practitioners in an 
open source software (OSS). There are many successful projects in OSS community, primarily the 

I , Mozilla web browser, the Linux operating system, the Apache web server, and to a lesser extent, 

^ ■ PHP [12] and the Perl programming languages, as well as the MySQL database. OSS has drawn the 

^ I attention of users and developers because of its economic benefits [T^ . 

PHP is an open source software, and it is free for downloading and use. Main characteristics of 
PHP are described in [12) . 

MySQL has become the world's most popular open source database system because of its consistent 
fast performance, high reliability and ease of usage. Besides the fact that it is free, MySQL offers a 
wide range of possibilities [SJ dZ] ■ 

SQL is the standard language used for querying and analysis of data in a relational DBMS [4] . Un- 
fortunately, SQL has no vector and matrix computations. Interesting code for matrix SQL operations 
can be found in [16] . The SQL constructions and the SQL primitives for data mining are proposed in 
[1]. However, these constructions do not offer adequate and flexible tools for matrix manipulation. A 
graphical SQL query generator and query operators with embedded matrix objects are disclosed in [3] . 

Implementation of some vector and matrix operations based on programming User-Defined Functions 
{UDFs) is studied in [15]. UDFs represent a C programming interface that allows the definition of 
scalar and aggregate functions that can be used in SQL. U DFs have several advantages and limitations. 
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An U DF allows fast evaluation of arithmetic expressions, memory manipulation, using multidimensional 
arrays and exploiting all C language control statements. 

The organization of the paper is as follows. Motivation of the paper is described in the second section. 
In the third section, we describe application details. In the fourth section, we develop data storage and 
computing system, which manipulate with different types of matrices, based on the PHP/MySQL 
environment. Functions and procedures are hardwired to a common matrix table. Influence of different 
matrix representations in conjunction with storage systems to performances of the implemented software 
is considered through numerical experiments. All test matrices and the results can be stored in files 
and the database on the server-side. A few illustrative examples and comparative studies are presented 
in the last section. 

2 Preliminaries and motivation 

Some mathematical formulations and motivation are involved in this section. Let C be the set of complex 
numbers, C™^" be the set of m x n complex matrices, and C"'-^" = {X e C™^" : rank(X) = r}. For 
any matrix A G cmx" and positive definite matrices M and TV of the orders m and n respectively, 
consider the following equations in X, where * denotes conjugate and transpose: 

(1) AXA = A (2) XAX = X 

(3M) (MAX)* = MAX {AN) {NXA)*=NXA. 

The matrix X satisfying (1), (2), (3M) and (4N) is called the weighted Moore-Penrose inverse of A, 
and it is denoted hy X = A\jj^. In the particular case M = 1^ and N = 1^ the matrix X = A\jj^ 
comes to the Moore-Penrose inverse of A, and it is denoted hy X = A'' . 

The Greville's partitioning method for numerical computation of generalized inverses is introduced 
in [5]. The following computational experience is delivered in "when applied to a square, fully 
populated, non-symmetric case, with independent columns, the Greville's algorithm found that the 
approach can be up to eight times faster than the conventional approach of using the SVD; rectangular 
cases are shown to yield similar levels of a speed increase" . Due to its computational dominance, this 
method has been extensively applied in many mathematical areas, such as statistical inference, filtering 
theory, linear estimation theory, optimization and more recently analytical dynamics [5S] (see also [5]). 
An application of the partitioning method in a direct approach for computing the gradient of the pseudo- 
inverse is presented in . It has also found wide applications in the database and the neural network 
computation [13]. In the paper [7], the sequential determination of the Moore-Penrose generalized 
inverse matrix by dynamic programming is applied to the diagnostic classification of electromyography 
signals. 

The Greville's partitioning algorithm is extended in many papers. Wang in |26j generalizes Gre- 
ville's method to the weighted Moore-Penrose inverse. The algorithm for computing the Moore-Penrose 
inverse of one- variable polynomial and/or rational matrices, based on the Greville's partitioning algo- 
rithm, is established in [19]. An extension of the results from [T^ to the set of two- variable rational 
and polynomial matrices is introduced in the paper [18j . In the paper [22| we extended the Wang's 
partitioning method from [26] to the set of one- variable rational and polynomial matrices. An efficient 
implementation of the algorithm introduced in |22j is established in |17| . An implementation of the 
algorithm introduced in [^l] for computing generalized inverses is based on the LU factorization of the 
matrix product. 

For the sake of completeness, we restate the algorithm introduced in [26], applicable to rational and 
constant matrices. The algorithm is quite appropriate for computation of the weighted Moore-Penrose 
inverse as well as the Moore-Penrose inverse and regular inverse. 
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Algorithm 2.1 (G.R. Wang, Y.L. Chen.) Computing the weighted M-P inverse A\j j^. 

Require: Let A G M™^", M and N be p.d. matrices of the order m and n respectively. 
1: Ai = ai. 
2: if ai = 0, then 
3: Xi = {alMai)-^alM; 
4: else 

5: Xi = 0. 

6: end if 

7: for fc = 2 to n do 

8: dk=Xk-iak, Ck = afc - Ak^idk, 

9: if Cfc 7^ 0, then 

K = {clMcky^clM, goto Step^ 

11: else 

12: 5k = ukk + dlNk-idk - {dlh + l*kdk) - 11(1 - Xk-iAk-i)N^l^lk, 

13: hl = 6l\dlNk-i-ll)Xk-u 

14: end if 

y _ \ Xk-i - {dk + Xk-iAk-i)N-\lk)hl ' 
16: end for 

17: return A\^j^ = X„. 

Comparison of searching in large databases with reconiputation in recursive algorithms is one of 
the questions on which we will focus in this article. The partitioning method is chosen in our article 
because of its recursive structure. Substantial effort is dedicated to a tight coupling of database and 
matrix computations, particularly in the inversion problem. Our idea is to memorize all input and 
resulting matrices in database tables, and as we need some of them, simply find results in appropriate 
table. Obviously, it is more efficient to find results on the server rather than perform recomputation, 
in the case when input matrices and requested operations are stored on the server side. This approach 
skips the computational logic and reduces the CPU time. We discuss the implementation of Algorithm 
12. li on a numerical DBMS and present experimental results demonstrating the performance benefit. 

Many scientists ignore the fact that most of data that are subject to analysis has been already 
exploited and can be stored in database systems, as well as that database systems provide powerful 
mechanisms for accessing, filtering and indexing data. We develop an application which memorizes all 
input matrices and results of performed matrix computations. The main challenge is at first to identify 
appropriate matrix representations and secondly to implement them using numerical DBMS extension 
facilities, e.g. storing, indexing, searching, etc. To achieve our aim, we implemented and tested different 
methodologies for storing matrices in the database and systems of the matrix representation (application 
logic) in the three-tier architecture. 

We develop an algorithm for storing every input matrix and results of computations by a database 
system. The algorithm is applied in order to develop the client/server Web application for computing 
the Moore-Penrose inverse, the weighted Moore-Penrose inverse as well as the fundamental matrix 
operations. Based on the analysis of available algorithms and our previous work in this area (see 
[TTI [T8l fT9l [20l [22l [2TJ [23l [24]), we have identified operations, which are useful for a number of similar 
algorithms. 

The following figure illustrates our idea. We compare the input matrices (entered in the web forms) 
and required matrix operations with matrices and operations in the database table. If these matrices 
and required operations exist in a database table, the computational logic is determined by the steps 
(1, 2, 3, 4, 5). Otherwise, the computational logic follows the path (1, 2, 3, 6, 7, 5), as it is illustrated on 
Figure [TJ 
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Figure 1: A computing machine on the middle tier 



An extension of DBMS with fundamental vector and matrix operators supported by programming 
in the PHP/MySQL environment is studied. We experimentally compare the influence of different 
matrix representations and storage systems in computation of the weighted Moore-Penrose inverse using 
client-server architecture and SQL, with respect to performance, ease of use, flexibility and scalability. 
The PHP scripting language is used as our middle-tier scripting language in the three-tier architecture 
model of the web database application. 

The research questions, we should answer in this article are the following: 

- Can PHP/M ySQL environments help in writing common matrix operations aimed for computing 
the generalized inverse? 

- Can we take advantages of the PHP language to implement vector and matrix operations in a 
database? 

- How different matrix representations, implemented using client-server architecture and SQL, en- 
hances to performance, ease of use, flexibility and scalability of algorithms for computing generalized 
inverses? 

In the present article we, continue the idea used in [20| . where the matrix database storage is used 
in the pseudo-inverse computation. Besides the routines which compute the weighted Moore-Penrose 
inverse, we develop a set of routines for implementation of the matrix library. In this way, we also 
continue results from [5] , where a dynamic parallel matrix library is introduced. Our fundamental matrix 
operations include addition, subtraction, multiplication of two matrices, computation of determinants, 
as well as the pseudo-inverse and the weighted pseudo-inverse computations. All these operations are 
considered for both dense and sparse matrices, with a possibility of expansion on diagonal, triangular 
and symmetric sparse matrices. We also overcome results from the paper [14], where a set of FORTRAN 
subroutines for testing computer programs aimed for computing the generalized inverse is presented. 
Usage of the programming language PHP overcomes bounds of the FORTRAN and C subroutines from 
[l4] and [8] in the following possibilities: 

1. Usage of the Web oriented programming paradigm available in PHP in conjunction with HTML 
and XML; 

2. Usage of the object oriented programming (OOP); 

3. Possibility to use the data storage system available in MySQL database system; 

4. Usage of the Apache Web server. 

Therefore, the main idea is to provide an appropriate client-server application, in the free open 
source PHP/MySQL development environment, utilizing the minimum of resources: Internet browser 
and the operating system. 
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3 Application details 

Database developer SQL and SQL Server does not support direct operations on matrices. However, 
because tables and matrices share the same structures, SQL allows easy manipulations with matrices. 
The present article demonstrates a few SQL techniques for performing basic matrix operations. As we 
have mentioned, our idea is to implement a client /server application supported by the database usage, 
in order to find a solution which is already placed in the database. Our application makes possible 
operations with one, two and/or three arguments. The application is a combination of PHP/MySql 
elements and implements special database operations, which support the SQL software implementation 
of a wide range of algorithms. 

Figure [21 shows how the application works. 
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Figure 2: The matrix computation system 



Each input matrix is, firstly, transformed into a string, with the elements separated by commas. 
Then the generated strings, subject to the required operations, are compared with data stored in the 
database. If the searching criteria are successfully ended, then a recomputation is not needed, and 
result will be displayed immediately in an appropriate web form. 

The possibilities offered by the application are the following: 

- unary operations: unary matrix operations, where only a single matrix and an optional coefficient 
is used to produce a unique result. rA, A^,...); 

- binary operations: binary matrix operations, involving two matrices {A -\- y. B'^,...); 

- ternary operations: operations involving three matrices, such as the computation of the weighted 
Moorc-Pcnrose inverse A\jj^. 

User can define input matrices in two ways: 

- by element- wise entering values for elements of the matrices; 

- by uploading txt file, which contains matrix entries, under the assumption that matrix rows match 
with lines of the file and matrix elements are separated with the blank character. 

For this approach has been developed suitable Web interface illustrated in Figure [31 
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Chase an opBon from lutyHenu and ddi mnilitiel ^^^^^^g 

Figure 3: Web interface of the application 

How docs the appHcation work? General algorithm is defined globally based on the next three steps. 

The first step. The user chooses a matrix operation that will be implemented, via web browsers. 

The second step. Select the type of matrix that will be proceeded and define entries. There are three 
types of matrices for processing: dense, sparse or test matrices. More, it is possible to define entries of 
the input matrix in one of the next three ways, as it is illustrated on Figure HI 

- loading the matrix from an existing textual file, using the button Browse. Matrix is stored in 
memory as txt file, assuming that the elements of the matrix are separated with a comma, and the 
matrix rows are separated by the new line. 

- entering the number of rows, columns and later values of the elements. 

- entering a complete matrix in the text area box, assuming that its entries are separated by the 
blank character. 

The third step: Generate and preview the solution. 
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Figure 4: An example how to input matrix elements 



All the once handled matrices, chosen operations and the generated results of processing are stored 
on the server in the appropriate form in txt files. In this way, it is possible to avoid recomputations and 
download solutions of previously performed operations. Users are also allowed to use ready-made test 
samples. 



4 The Storage system 

The matrices and their storage representations implemented in the paper can be classified in the fol- 
lowing two groups. 

Randomly generated and test matrices arc stored in the database in two different storage 
systems. The row matrix format [R format) assumes that all matrix elements are placed in a vector, 
so that the matrix is represented as a string containing values of the matrix elements separated by 
the comma. On the other hand, the mR format represents a matrix in a database table under the 
assumption that the number of records in the table is equal to the number of rows of the matrix. 

Sparse unstructured matrices arc matrices whose entries are equal to zero for the most part and 
possess a distribution of the nonzero entries which do not match any specific pattern. There are various 
storage schemes, which minimize the memory space and computational requirements, by storing and 
performing arithmetic with only the nonzero elements. The simplest sparse matrix storage structure 
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is the Coordinate Format (COO format) [2], where the matrix is stored in three appropriate vectors, 
which represent the underlying sparse structure. The first vector (resp. the second vector) stores the 
row indices (resp. the column indices) for all non-zero entries. The third vector stores non-zero entries 
of the sparse matrix. 

The SQL codes for creating the data structure and tables on MySQL database server are described 
as follows. The CREATE TABLE steLtement has three parts: 

- A table name succeeds the CREATE TABLE statement. 

- A list containing attribute names, types, and modifiers succeeds are placed between the parenthesis. 

- A key list follows after the attribute list between the parenthesis. 

In our implementation, we use only two different names for tables, namely matrices Jn and matrices-Out; 

# Table structure for table ' matrices _in ' 
CREATE TABLE matrices.in ( 

id. in int(ll) NOT NULL auto.increment , 
elements.in longtext NOT NULL, 
dimension tinytext NOT NULL, 
test tinytext NOT NULL, 
sparse tinytext NOT NULL, 
PRIMARY KEY (id. in) 
) TYPE=MyISAM; 

# Table structure for table ' matrices. out ' 
CREATE TABLE matrices.out ( 

id. out int(ll) NOT NULL auto. increment , 
elements. out longtext NOT NULL, 
operation tinytext NOT NULL, 
matrix. I int{ll) NOT NULL default '0', 
matrix. II int(ll) NOT NULL default '0', 
matrix. Ill int(ll) NOT NULL default '0', 
r tinyint(4) NOT NULL default '0', 
s tinyint(4) NOT NULL default '0', 
p tinyint(4) NOT NULL default '0', 
q tinyint(4) NOT NULL default '0', 
PRIMARY KEY ( id.out ) 
) TYPE=MyISAM; 

The fields used in the database tables are described as follows. 

- idjin or id-Out: identification number, defined as auto -increment; 

- elementsJn or elements-Out: string of type longtext, containing matrix entries separated with 
the comma character. 

- dimension: string of the form 'm x n' which contains dimensions m and n of the matrix; 

- test: string representing the name of a test matrix from |28] . or the empty string (if the input 
matrix is not a test matrix); 

- sparse: flag of type tinytext, possessing the value '0' if the matrix is non-sparse or one of the 
values, '1', '2', '3' for a sparse matrix; this field is important for searching the matrices already stored 
in the database; 

- operation: It defines operations on input matrices (A + B, rA -f sB, A^^ + B^^ ...) or defines the 
computation of the weighted Moore- Penrose inverse; 

- matrix-I, matrix-II and matrix-IIT. contain IDs from the table matricesjin or default value 
'0' in order to find a solution for the chosen operation and entered matrices; 

- s, p and q: optional coefficients used to determine matrix operations rA, sB, A^ and i?^, 
respectively. 

It is clear that dimensions of matrices and vectors used as arguments are limited by the maximal 
length of the type longtext, which is within the interval [0, 2^^ — 1]; 

Example 4.1. In the present example matrix _A defines the test matrix A from [28] in the particular 
case a = 1, matrix_B is randomly generated and the matrix jC is sparse and unstructured. 



Computation of the generalized inverses using Php/MySql environment 



9 



matrix_A ■ 



11 


10 


9 


8 


7 


6 


5 


4 


3 


2 - 


10 


10 


9 


8 


7 


6 


5 


4 


3 


2 


9 


9 


9 


8 


7 


6 


5 


4 


3 


2 


8 


8 


8 


8 


7 


6 


5 


4 


3 


2 


7 


7 


7 


7 


7 


6 


5 


4 


3 


2 


6 


6 


6 


6 


6 


6 


5 


4 


3 


2 


5 


5 


5 


5 


5 


5 


5 


4 


3 


2 


4 


4 


4 


4 


4 


4 


4 


4 


3 


2 


3 


3 


3 


3 


3 


3 


3 


3 


2 


1 


2 


2 


2 


2 


2 


2 


2 


2 


1 





1 


1 


1 


1 


1 


1 


1 


1 





-1 



matrix^B - 



matrix_C -■ 





282 




-11 




206 




-39 




84 




94 




-11 




241 




-80 




129 




121 






86 




-206 




-80 


306 




4 




-113 






2 




-39 




129 




4 


394 




-19 




-219 




84 




121 




113 




-19 




119 




15 




94 




-86 




2 




-219 




15 




184 




■ 1 


2 


3 
































2 
































1 








4 

































































4 











8 




































































































































































. 


























2 




10 



The corresponding fragment of the table matrices_in is illustrated in Table 1. 



matricesjn 


idSn 


elementsSn 


dimension 


test 


sparse 
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11, 10, 9, ...,1,1, 1,0,-1 


11 X 10 
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Table 1. Matrices in the database table matrices Sn 



Three different matrices are stored in Table 1. The matrix -A, which represents the test matrix Anxio 
from j28j , is accommodated in the first row of the table. The representation of the matrix-B is placed 
in its second row. In the third, fourth and fifth rows is the sparse matrix representation for matrix JJ 
in the COO format. 

The adjoint fragment of the table matrices_out is illustrated by Table 2, where A{~\) and A{MN) 
denote the inverse and the weighted Moore-Penrose inverse, respectively. 
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Table 2. Matrices in database table matrices ^out 
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4.1 Relations between matrix representations and matrix storage systems 

We present different metliods for computing tlie pseudo-inverse ^jy/ ^^'^ executing fundamental ma- 
trix operations, depending on tlie relationship between the implemented storage systems and matrix 
representations. As we have already mentioned, matrices are stored in the database in three formats; 
the R format, the mR format and the COO format (only for the sparse matrix). 

To accelerate the search of the resulting matrices stored in the database as well as to improve the 
pseudo-inverses computation, we exploit correlations between the presented storage systems indicated 
in Figure O 
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1 D array 
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Figure 5: Relationship between storage systems and matrix representations 



We implemented different computational systems for dense as well as for sparse matrices in order to 
detect the best. For dense matrices we use the computational systems illustrated on Figure [HI a. For 
sparse matrices we use a more suitable computational system, as shown on Figured b. 



MR: ID array 
SS: R formal 



MR: ID array 
SS: mR format 



MR: 2D array 
SS: R format 









computation system 
for denss matrix 


A 




7—^ 



sparse 



MR; 2D array 
SS: COO formal 



computation system 
tor sparse rnatrlx 



MR: 20 array 
SS: R fcmmal 



MR - malm represertation 
SS ' storage system 



Figure 6; The computational systems for dense and sparse matrices 



UDFs for both dense and sparse matrices are implemented in the 2D array format. UDFs for 
sparse matrices are adapted to COO format storing system, opposite to UDFs for dense matrices, 
which are adapted to R format. 
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4.2 Implementation of matrix operations 

Implementation of an arbitrary matrix operation in the database tier can be described by the following 
algorithm. 



Algorithm 4.2 Implementation of an arbitrary matrix operation. 
Require: Input matrices. 

1: Define the input matrices and/or coefficients for processing and select the matrix operation. 

2: Form the strings which define elements of entered matrices, applying Procedure 14.21 for each input 
(dense) matrix. 

3: Search entered matrix in the database performing Procedure 14.31 

4: if the input matrices and required operations exist in the database then 

5: search the solution applying Procedure 14.51 

6: if the solution exists then 

7: goto Step 16 

8: else 

9: goto Step 14 
10: end if 
11: else 

12: Store entered matrix by applying Procedure 14.41 and goto Step 14- 
13: end if 

14: Processing the input matrix executing Procedure 14.81 

15: Store the result performing Procedure 14.71 

16: return the result from the database using Procedure 14.61 



We provide some procedures, applicable to dense matrices, that are used in our implementation. 

Procedure 4.1. Procedure 14.11 defines the input matrix by loading the corresponding txt file. It can 
be used in Step 1 of Algorithm 14.21 

function u p 1 o ad _f i 1 e { ) { 

global Suserfile , $userfile_namc , $ u s c r f i 1 e _s i z c , $ u s c r f i 1 c _t y p c , $archivc_dir ,$WINDIR; 
if ( isset (SWINDIR)) $ u s e r f i 1 e = $ s t r _r c p 1 ac c (" \\ \\" , " \ \" , $ u s e r f i 1 e ) ; 
$filename=bascnamc ( $userfile_name ) ; 

i f ( $ u s e r f i 1 e _ s i z c <= 0) die (" Sfilename is empty."); 
if ( ! ©copy ( $ u s e r f i le ," $ ar chive_dir /$ filename ") ) 
die("Can't copy $userfile_name to $f ilename . " ) ; 
if (! isset ($WINDIR)&&!@unlink( Suserfile )) 
die("Can't delete the file $uscrfilc_name . " ) ; } 

Procedure 4.2. Code used in Step 2 of Algorithm 14.21 for defining the string (vector) which includes 
entries of the dense matrix. 

function make_vector ( $ f i 1 c ) { 

$m=count ($ f i 1 e );// number of rows 
$st r ing =" " ; 
for($i=0;$i<$m; $i++){ 

$file[$i] = rtrim($file[$i])." Sstring 
$stringl = $str_replace (" " ," ," , Sstring ) ; 
$string2=substr ( Sstringl ,0 , strlen ( Sstringl ) 
return $string2;} 



= $string . $file [ $i ] ; } 

— l);//vector of all elements from matrix 



Procedure 4.3. The searching of a dense matrix with randomly chosen elements is implemented by 
the next function. The search of a test matrix is defined by its unique name. 
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function i f _c x i s t ( $ f i 1 c ) { 

$db_link=db_conncct ();// connect on the database 
$m=m( $ fi 1 e );// number of rows 
$n=n ($ fi 1 e );// number of columns 

$dimension=Bm ."x".$n; $str i ng 2=make .vector ( $ f i 1 e ) ; 
$query="select * from MATRICESJN where DIMENSION='$dimension "' ; 
$result=mysql_query ( $ query , $db_link ) ; $sum=0; 
while ( $row=my sql_fet ch_r o w ( Srcsult )){ 

if ($row[l] = =$string2 ){$sum++; break;}} 
if ($sum==l){ 

$id_matrix=$row [0] ; } / / matrix exist 
else 

$ id _mat r ix=f a 1 s c ; / / matrix don't exist 
return $id_matrix;} 

Procedure 4.4. If the input matrix does not exist in the database, we store it applying the next code. 

function cntcr.matrix ( $string2 , Sdimcnsion , $test ) { 
$db_link=db .connect ( ) ; 

$query=" insert into MATRICESJN (ELEMENTS JN .DIMENSION, TEST) 
values('$string2 ','$dimension','$test ')"; 
$result=mysql_query ( $ query ,$db_link);} 

Procedure 4.5. If the entered matrices and operation aheady exist, we search the resuh from the 
database. 

function result_exist (Soperation ,$matrixl ,$matrix2 ,$matrix3 ,$r ,$s ,$p,$q){ 
$db_link = db_conncct ( ) ; 

Squery = "select * from MATRICES.OUT where OPERATION ='$operation ' and 

MATRIXJ='$matrixl ' and MATRIXJI= ' $matrix2 ' and MATRIX_III = ' $matrix3 ' 

and R='$r ' and S = '$s ' and P='$p' and Q='$q "' ; 

$result=mysql_query ( $ query , $db_link ) ; 

$row=mysql_fetch_array (Sresult ); 

if ($row[l] = =NlJLL){ $matrix_out=false ;} 

else{$matrix_out=explode (" ," , $row ["ELEMENTS.OUT" ] ) ;} 
return $matrix_out ; } 

Procedure 4.6. If the result exists in the database table, we display it. 

function p r es e nt e d _r es ul t { $ r es ul t ,$m,$n){ 
for ( $i=0,$z=0;$i<$m; $i++){ 

f or ( $j =0; $j <$n ; $j ++,$z++){ 

$result= round($result[$z] ,3); 

echo "<input name=elements [ ] typc=tcxt size=\"4\" maxlength = \" 5\" 
value =\" $ r es u 1 1 \" c 1 a s s =\" s t y le 2 1 \" readonly =\" readonly \" > ";} 
echo "<br>";} } 

Procedure 4.7. The next code is aimed to store the result of the matrix manipulation. 

function entcr_solution($result ,$operation , $id_ml , $id_m2 , $id_m3 ,$r,$s,$p,$q){ 
$db_link=db .connect ( ) ; 
$string=implode ( Sresult ," ,"); 
$query=" i n s c r t into 

MATRICES.OUT(ELEMENTS.OUT, OPERATION, MATRIXJ , MATRIXJI , MATRIX.III , R, S , P ,Q) 
values ( ' Sstring ' , ' Soperation ' , ' $id_ml ' , ' $id_m2 ' , ' $id_m3 ' , ' $r ' , ' $s ' , ' $p ' , ' $q ' ) " ; 
$ r es u lt=mysql .query ( $ query ,$db_link);} 



4.3 The pseudo-inverse computation 

The Php code which implements the weighted Moore-Penrose inverse A\j according to Algorithm 
12.11 Branching in the code is defined according to the selected storage system for our database table. 
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Procedure 4.8. The main function for computation of the Weighted Moore-Penrose inverse. 

fu net ion Weightedlnverse( SArrayDataMatrixM , SArrayDataMatrixA , SArrayDataMatrixN ) { 
$rows=count ( $ ArrayDataMatrixA ) ; $columns=count ( $ ArrayDataMatrixA [ ] ) ; 
$aa=ithCol ( 1 , SArrayDataMatrixA ) ; 
if (isZero($aa)==0) 

$ar=Transpose ( $aa ) ; 
e 1 s e { 

$ta=Transposc ( $aa ) ; 

$aIb=M ult ip li cat ionM at rices ( $ta , SArrayDataMatrixM ) ; 
$ali=MultiplicationMatrices($aIb ,$aa); 
$inv=In verse ( $ ali ) ; 

$ar=MuItipIic at ionM at rices ( $inv , $alb ) ; 

} 

for ( $i=2;$i<=$columns ; $i++){ 

$ii=ithCol ( $i , $ ArrayDataMatrixA ) ; 
$di=M ult iplic at ionM at rices ( $ar , $ii ); 
$fc=frstiCoI ($i - 1 , SArrayDataMatrixA ) ; 
Srmr^M ult iplic at ionM at rices ($fc ,$di); 
$ ci=SubsMat rices ($ i i ,$mni) ; 
$niml=mMinus( SArrayDataMatrixN , $i ) ; 
$nimlg=In verse ( $niml ) ; 
$Ii=mColumn($ArrayDataMatrixN , $i ) ; 
if ( isZero ( $ci)==0){ 

$nii=mskalar ( SArrayDataMatrixN , $i ) ; 
$dit=Transpose ( $di ) ; 

$dtn=MultiplicationMat rices ( $dit , $niml ) ; 
$dtnd=MultiplicationMatrices ($dtn , $di ) ; 
$dtnd=matNum( Sdtnd ) ; 

$ditl = MultiplicationMatrices($dit , $li ); 
$ I i t =Transpose ( $ 1 i ) ; 

$litdi = MultiplicationMatriccs($lit,$di); 
$ditdi=SumaMatriccs (Sditl , Slitdi ); 
$ditdi=matNum{ Sditdi ) ; 

$nimlli=MultiplicationMatrices ( Snimlg , $li ); 
$litnIi=MultipIicationMatrices ( Slit , Snimlli ) ; 
$litnli=matNuni( Slitnli ) ; 

$lktar = MultiplicationMatrices ( Slit , Sar ) ; 
Sar nklk=M ult ipli cat ion Mat rices (Sfc .Snimlli); 
$novo=Multiplicat ionM at rices (Slktar ,$arnklk); 
$novo=matNum ( Snovo ) ; 

$ s i=$ n ii+Sdtnd — $ di t di — S 1 i t n 1 i+Snovo ; 
$dtnar=MultiplicationMatrices ($dtn,Sar); 
Sdilit=SubsMatrices(Sdtnar , Slktar ); 
Sbit=Scalar Matrix (Sdilit ,l/$si); 

} 

else { 

$cit=Transpose ( Sci ) ; 

$citm=M ult i plicat ionM at rices ( Scit , SArrayDataMatrixM ) ; 
$pom=M ult iplic at ionM at rices (Scitm,Sci); $pom=matNum ( $pom ) ; 
Sbit=ScalarMatrix ( Scitm , 1/ Spom ) ; 

} 

Snimlli=MultiplicationMatrices(Snimlg, Sli ); 
$arai=MultiplicationMatrices (Sar , Sfc ) ; 
$arniml=M ult iplic at ionM at rices (Sarai , Snimlli); 
$pi=Subs Mat rices (Snimlli ,Sarniml); 
Skl=M ult iplic at ionM at rices ( Sdi , Sbit ) ; 
Sarkl=SubsMatrices($ar , Ski ) ; 
Spib=M ult iplicat ionM at r ices (Spi,Sbit); 
Sar=S ubs Mat rices ( Sarkl ,Spib); 
Sar=appRow ( Sar , S bi t ) ; } 
return Sar;} 

Implementation of previous function requires several auxiliary procedures. 
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Function ithCol generates the ith column 04 of A. 

function ithCol( Scolumn , $ ArrayDataMatrixl ) { 

$rowsl=count (SArrayDataMatrixl); $columnsl=count (SArrayDataMatrixl [0] ) ; 
for ( $i=0;$i<$rowsl ; $i++) { 

for ( $j =0;$j<$columnsl ; $j++){ 
i f ( $j+l==$column ) 

SArrayCol [ $i ] [0] = SArrayDataMatrixl [ $i ] [ $j ] ; } } 
return SArrayCol;} 

The submatrix Aj = [ai^...^aj] which contains first j < n columns of the matrix A = A„ = [ai^..._a„] can 
be generated performing the next function: 

fu notion frstiCol (Scolumns , $ArrayDataMatrixl){ 

$rowsl=count (SArrayDataMatrixl); $ columns l=count (SArrayDataMatrixl [0] ) ; 
for($i=0;$i<$rowsl; $i++){ 

for ( $j =0;$j<$columnsl ; $j++){ 
i f ( $j+l<=$columns ) 

SArrayCol [ $i ] [ $j ] = $ ArrayDataMatrixl [ $i ] [ $j ] ; } } 
return SArrayCol;} 

Procedure matNum() converts matrix of dimension 1x1 into the number. 

function matNum ($ArrayDataMatrixl){ 

$rowsl=count ( $ ArrayDataMatrixl ) ; $ columns l=count (SArrayDataMatrixl [0] ) ; 
i f ( $rowsl==l&$columns 1==1) 
return SArrayDataMatrixl [ ] [ ] ; } 

Function isZero() checks whether the parameter is zero matrix or not. 

function isZcro( $ Array DataMatrixl ) { 

$isZcro=0; $rows l=count ($Arr ay DataMatrixl); Scolumns l=count ($Arr ay DataMatrixl [0]); 
for($i=0;$i<$rowsl; Si++){ 

for ( $j =0;$j<$columnsl ; $j++){ 

if (round ($ ArrayDataMatrixl [ $ i ] [ $ j ] , 3 ) ! = ) 
$isZero++; } } 
return SisZero;} 

Function appRow() appends the column vector Y to the matrix X. 

function appRow ($Arr ay DataMatrixl , $ArrayDataMatrix2){ 
$isZero=0; 

$rowsl=count ( $ ArrayDataMatrixl ) ; 
$coIumns2=count ( $ArrayDataMatrix2 [0] ) ; 
for($i=0;$i<$columns2; $i++){ 

SArrayDataMatrixl [Srowsl ] [ $ i ] = $ ArrayDataMatrix2 [0] [ $i ] ; } 
return $ ArrayDataMatrixl ; } 

Finally, the function Inverse() returns the inverse of the input parameter. 

function Inversc($ArrayDataMatrix){ 

$rows=count ( $ ArrayDataMatrix ) ; $columns=count (SArrayDataMatrix [0] ) ; 
$ n ii=mskalar ( $ ArrayDataMatrix , 1 ) ; 
$N=l/$nii; $N=array ( array ( $N )) ; 
for($i=2;$i<=$columns; $i++){ 

$s=mskalar ( SArrayDataMatrix , $i ) ; 

$ n i i=array ( array ( $s ) ) ; 

$ li=mColumn ( $ ArrayDataMatrix , $ i ) ; 

$ I i t=T ran s pose ( $ 1 i ) ; 

$giip=MuItipIi cat ionM at rices ( $Iit ,$N) ; 
$ gi i 1=M ult ip li cat ion M at r i ces ( $giip , $Ii ); 
$gii=SubsMatriccs($nii , $giil ); 
$gii2=matNum( $gii ) ; $ g i i =1/ $ g i i 2 ; 
$fip=MuItipIicationMatrices ($N, $li ); 
$fil=Scalar Matrix ( $fip , $gii ) ; 
$fi=ScalarMatrix ( $fil ,-1); 
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$ f i t=Transposc ( $ f i ) ; 

$El=MultiplicationMatrices ( $fi , $fit ); 
$E2=ScalarMatrix ($E1,1/ $gii ); 
$E=SumaMatrices ( $N , $E2 ) ; 
$ArrayDataMatrixl=appRow($E , $fit ); 
$gi i n=arr ay (array( $gii )) ; 
$ ArrayDataMatrix2=appRow ( $fi , $giin ) ; 

$N=finaly ( SArrayDataMatrixl , $ArrayDataMatrix2 ) ; } 
return $N;} 



5 Examples 

Example 5.1. In the next table, we compare the CPU time spent for searching the matrices stored in 
the database by two different storage systems. 



Search of stored matrices 


number of matrices 


R format 


rriR format 


50 


0.102 sec. 


0.647 sec. 


100 


0.166 sec. 


0.660 sec. 


500 


0.7909 sec. 


14.079 see. 


1000 


1.901 sec. 


28.650 sec. 



Table 3. The CPU time for searching matrices dimension 70 X 70 in the R and mR format. 

This testing is performed for dense matrices with randomly chosen elements, from Table 3. we conclude 
that the CPU time for searching is much shorter when the R format is used. 

Let us mention that the searching of the test matrices from |28| is based on the search by a unique 
name, and does not depend on the number of the matrices stored in the database. 

Example 5.2. In Table 4- Qi'^e arranged CPU times required for computation of the pseudo-inverse 
A\^_^. Matrices are represented in two different ways: as ID arrays based on the mR and R format, 
or in the form of 2D arrays combined with the R format storage system. 





matrix representation: storage system 


m y. n 


ID array: niB. format 


ID array: R format 


2D array: R format 


A.30.3 


4.485 sec. 


2.440 sec. 


0.897 sec. 


A.50.4 


32.834 sec. 


21.270 sec. 


6.199 sec. 


F.15.2 


0.666 sec. 


0.342 sec. 


0.166 sec. 


F.30.3 


4.379 sec. 


2.347 sec. 


0.880 sec. 


F.50.4 


32.564 sec. 


20.981 sec. 


6.077 sec. 


S.50.4 


32.597 sec. 


20.874 sec. 


6.070 sec. 


S.80.5 


269.282 sec. 


206.007 sec. 


40.75 sec. 


50 X 50 


32.423 sec. 


21.294 sec. 


6.203 sec. 


15 X 15 


0.632 sec. 


0.471 sec. 


0.160 sec. 


50 X 35 


9.79 sec. 


5.587 sec. 


1.997 sec. 


45 X 70 


125.18 sec. 


86.210 sec. 


19.920 sec. 


60 X 60 


70.260 sec. 


49.964 sec. 


12.627 sec. 



Table 4. Computation time for A^J^Jpf when matrices don't exist in the database. 



Results of the testing show that the best processing time is achieved in the case when matrices are 
presented in the form of 2D arrays. 

Example 5.3. In this example, we compare the CPU times obtained during the computation of the 
pseudo-inverse A\j^ (in the case when the input matrices are not stored in the database) with the CPU 
times required for searching the same results (in the case when the input matrices are stored in the 
database). Matrices are presented in 2D array representation, R format storage system. 
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matrix representation: storage system 


m X n 


2D array : B.f ormat (exist) 


2D array : Rf ormat (don't exist) 


20 X 20 


0.043 sec. 


0.309 sec. 


30 X 30 


0.051 sec. 


2.105 sec. 


45 X 45 


0.062 sec. 


4.665 sec. 


50 X 50 


0.070 sec. 


6.729 sec. 


60 X 60 


0.075 see. 


12.675 sec. 


70 X 70 


0.079 sec. 


26.614 sec. 


80 X 80 


0.096 sec. 


43.809 sec. 



Table 5. The CPU time for A^j^^j^ when matrices exist and don't exist in the database. 

The CPU time required for searching the solution from the database is negligible with respect to the time 
needed for the recalculation, especially if the matrix has larger dimensions. Let us mention that 10030 
records are stored in the table matrices J,n at the moment of testing. 

Example 5.4. Comparison of the CPU time for executing fundamental matrix operations is given in 
Table 6. 







Fundamental 


operation on matrices 




m 


X n 


operation 


ID format: 


BJ ormat 


2D array: Rf ormat 


3x3 


3x3 


multiplication 


0.0468 


sec. 


0.0043 see. 


5x5 


5x5 


multiplication 


0.0481 


sec. 


0.0042 sec. 


10 X 10 


10 x 10 


multiplication 


0.0643 


sec. 


0.0062 sec. 


20 X 50 


50 X 50 


multiplication 


0.1531 


sec. 


0.0778 sec. 


45 X 45 


45 X 70 


multiplication 


0.5618 


sec. 


0.1369 sec. 


80 X 80 


80 X 60 


multiplication 


0.9325 


sec. 


0.7270 sec. 


80 X 70 


70 X 70 


multiplication 


0.9675 


sec. 


0.5600 sec. 


81 X 81 


81 X 81 


multiplication 


1.2831 


sec. 


0.7470 sec. 


3x3 


3x3 


addition 


0.0425 


sec. 


0.0040 sec. 


5x5 


5x5 


addition 


0.0431 


sec. 


0.0043 sec. 


10 x 10 


10 X 10 


addition 


0.0431 


sec. 


0.0048 sec. 


50 X 50 


50 X 50 


addition 


0.1400 


sec. 


0.0326 sec. 


60 X 60 


60 X 60 


addition 


0.2012 


sec. 


0.0480 sec. 


70 X 70 


70 X 70 


addition 


0.2481 


sec. 


0.0705 sec. 


80 X 80 


80 X 80 


addition 


0.3287 


sec. 


0.0919 sec. 


3x3 


3x3 


substraction 


0.0475 


sec. 


0.0041 sec. 


5x5 


5x5 


substraction 


0.0512 


sec. 


0.0044 sec. 


10 X 10 


10 X 10 


substraction 


0.0350 


sec. 


0.0050 sec. 


50 X 50 


50 X 50 


substraction 


0.1431 


sec. 


0.0319 sec. 


60 X 60 


60 X 60 


substraction 


0.1875 


sec. 


0.0733 sec. 


70 X 70 


70 X 70 


substraction 


0.2512 


sec. 


0.0415 sec. 


81 X 81 


81 X 81 


substraction 


0.3150 


sec. 


0.1196 sec. 



Table 6. The CPU time for ID array and 2D array in R format processing. 



The arranged results show that the smallest computational time is obtained in the case when the matrix 
is presented in the format of 2D array, R format storage system. 

Testing was done on the local machine and from client in a wireless network. Wc had an access to 
the web server using the infrastructure mode wireless networking with an access point. Testing was 
executed on the server machine with: Windows edition: Windows Vista^'^^^'^ Ultimate; Processor: 
Intel{R) Pentium{R) Dual CPU T3200 @ 2.00GHz; Memory (RAM) : 2M0MB; System type: 
32 - bit Operating System; Free Software: WAMP 5 1.7.4 installs: PHP 5.2.5, Apache 2.2.6 Server, 
MySQL 5.0.45 and phpMyAdmin 2.11.2.1. 

The CPU time shows that searching of a matrix which is given in the R format gives better results 
than the searching of the same matrix presented in the mR format. Furthermore, the best results 
are obtained in the case when matrices are represented in view of the two-dimensional arrays (on the 
middle tier), application logic adapted to the R format, and matrices are stored in the R format in the 
database. Searching time is inversely proportional with the number of data transferred between the 
middle of the application tier and the database tier. 
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6 Conclusions 

This research contributes to the development of an OSS apphcation and construction of the matrix 
hbrary, especially in computation of the generalized inverses. Several storage technics are discussed 
and tested in the PHP/MySQL environment. Data's preparation as an important preprocessing step 
of data mining is a further application for database matrix computations. There are several issues for 
future work. We plan to develop mechanisms which decrease memory usage and minimize the searching 
time in accordance with different storage systems. Furthermore, it is possible to adapt UDFs to 
sparse structured matrices, such as diagonal matrices, sparse symmetric matrices, triangular matrices, 
Toeplitz matrices, etc. We need to identify other mathematical operations with wide applicability that 
can be implemented with the PHP/MySQL environment, thereby enhancing the DBMS data mining 
functionality. Also, further implementation of the matrix library will be based on the principles of 
Object-Oriented Programming . 
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